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In this paper, we propose a covariate-adjusted nonlinear regres- 
sion model. In this model, both the response and predictors can only 
be observed after being distorted by some multiplicative factors. Be- 
cause of nonlinearity, existing methods for the linear setting cannot 
be directly employed. To attack this problem, we propose estimating 
the distorting functions by nonparametrically regressing the predic- 
tors and response on the distorting covariate; then, nonlinear least 
squares estimators for the parameters are obtained using the esti- 
mated response and predictors. Root n-consistency and asymptotic 
normality are established. However, the limiting variance has a very 
complex structure with several unknown components, and confidence 
regions based on normal approximation are not efficient. Empirical 
likelihood-based confidence regions are proposed, and their accuracy 
is also verified due to its self-scale invariance. Furthermore, unlike the 
common results derived from the profile methods, even when plug-in 
estimates are used for the infinite-dimensional nuisance parameters 
(distorting functions), the limit of empirical likelihood ratio is still 
chi-squared distributed. This property eases the construction of the 
empirical likelihood-based confidence regions. A simulation study is 
carried out to assess the finite sample performance of the proposed 
estimators and confidence regions. We apply our method to study the 
relationship between glomerular filtration rate and serum creatinine. 
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1. Introduction. Consider the covariate-adjusted nonlinear regression model 



where Y is an unobservable response, X = {Xi, . . . , XqY is an unobservable 
predictor vector, /3 is the unknown p x 1 vector parameter from a compact 
parameter space C i?^, f{X,f3) is a given continuous function in X £ 
and /3 S Y and Xr are the actual observable confounding variables and 
tp{U) and (priU) are the unknown distorting functions of observable vari- 
able U. In this paper, we study point estimation and confidence region con- 
struction for the parameter f3. When /(•) is linear in (3, say, f{X,(3) = 
/3o + Ef=iA^«, Sentiirk and Miiller (2005) and Sentiirk and Miiller (2006), 
respectively, studied the consistency and asymptotic normality of the es- 
timators and hypothesis testing. The model was motivated by an analysis 
of the regression of plasma fibrinogen concentration as response on serum 
transferrin level as predictor for 69 haemodialysis patients [see the details 
in Sentiirk and Miiller (2005)]. 

An example where a nonlinear model is relevant is the use of serum crea- 
tinine (SCr) to estimate glomerular filtration rate (GFR). GFR is tradition- 
ally considered the best overall index of renal function in health and dis- 
ease. GFR can be measured by clearance techniques involving endogenous 
filtration markers (e.g., creatinine and urea) or exogeneous markers (e.g., 
inulin, iohexol and iothalamate) . Clearance studies of exogenous markers 
have been identified as the valid approach to measuring GFR across the 
spectrum of renal function. However, owing to the difficulty, complexity 
and expenses associated with measurement of GFR using clearance studies 
in clinical practice, clinicians traditionally estimate GFR from serum cre- 
atinine concentration. Because of the nonlinear relationship between GFR 
and SCr, numerous equations using different transformations of SCr have 
been developed, the most well-known being derived from the Modification 
of Diet in Renal Disease (MDRD) Study [see, e.g., Levey et al. (1999)]. 
It is well known that GFR and SCr are also related to body surface area 
[BSA(m2) = 0.007184 * Kg°-^^^ * cm°-^^^]. Despite the nonlinear relationship 
between GFR and BSA, the adjusted GFR is usually calculated by simply 
dividing the unadjusted GFR over 1.73 times the BSA. The relationship 
between SCr and BSA is also nonlinear. Therefore, correcting the nonlinear 
effects of BSA on GFR and SCr may improve the estimated relationship 
between GFR and SCr, thereby leading to better estimation of GFR from 
SCr. As an illustration, we consider the data collected from the study A 
and study B of the MDRD study [see, e.g., Rosman et al. (1984) and Levey 
et al. (1994)]. Here, the relationship between GFR and SCr is of particular 
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interest. We aim to show that the estimated relationship between GFR and 
SCr is more distinct after correcting for the distorting effect of BSA. Because 
the relationship of GFR and SCr is nonlinear, the covariate-adjusted linear 
regression model proposed by Sentiirk and Miiller (2005) cannot be used in 
our application. We will return to the example in Section 5. 

To estimate the parameters in such nonlinear models, a natural consider- 
ation is whether the method proposed by Sentiirk and Miiller (2005) in the 
linear covariate-adjusted regression can be extended to the nonlinear setting. 
The basic idea of their method is to transform it into a varying-coefficient 
regression model, say, 

(2) Y = aoiU)+Y,aiiU)Xi + ijiU)e, 

1=1 

where aoiU) = MiU), ai{U) = f3i^, I < I < p. Note that /?o = lE[ao(C/)] 



and Pi = ^i2l^!^)2^lL ^ 1 < / < p. Then, the bin method similar to that proposed 

in Fan and Zhang (2000) for longitudinal data is used to obtain consistent 
estimators a/, < Z < p. The estimators were further proved to be asymp- 
totically normal by Sentiirk and Miiller (2006). The estimators of /3o and 
Pi can be obtained by replacing all the expectations by the corresponding 
weighted averages and ai by d;. 

For model (1), when the nonlinear regression function f{X,(3) has a 
particular structure such as f{X,P) = f{(3Q, (3iXi, . . . , (3qXg), we can let 
oti{U) = Pi/(j)i{U) and then transform the model to a nonlinear varying coef- 
ficient regression model to obtain estimators for a/. The relevant references 
are, among others [Staniswalis (2006) and Fan, Lin and Zhou (2006)]. The 

estimators for di can be defined by using the fact di = ^1^jMD2^A_ However, 

when / is also related to X^' for some given ai, a similar approach cannot 
make f3i estimable unless the expectations of <j)i{U)"''' are given constants. 
When ai are also unknown parameters, we even have a problem to iden- 
tify the parameters f3i and ai. Hence, it is obvious that, for more general 
models, we may not expect a useful, straightforward extension of this trans- 
formation method. This observation motivates us to develop a new approach 
to handle nonlinear models. In this paper, we consider a direct estimation 
procedure. We first use nonparametric regression to obtain consistent esti- 
mators of the distorting functions and 4>r{') by regressing the response 
and predictors on the distorting covariate, respectively, and obtain the es- 
timates (Y , Xi, . . . ,Xq) for the unobservable response and predictors. We 
then apply least squares method to obtain the estimates of P in terms of 
{Y,Xi, . . . ,Xq). Note that the step of estimating the distorting functions is 
independent of the step of estimating f3. Thus, this method is obviously able 
to obtain a nonlinear least squares estimator of f3 for model (1). 
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There are two issues that need to be taken into account when the above 
procedure is apphed. Ahhough the above estimation procedure in principle 
is apphcable, the root n-consistency and asymptotic normahty are more dif- 
ficult to study than the method suggested by Sentiirk and Miiller (2005) for 
linear models. This is due to the fact that nonlinear least squares estimator 
does not have a closed form and nonparametric estimators of the distort- 
ing functions have slower convergence rate than root n. We propose using 
the Delta method and J7-statistics theory to obtain the asymptotic normal- 
ity. The details are given in the Appendix. However, even though we can 
obtain the root n-consistency and asymptotic normality, the results in Sec- 
tion 2 show that the limiting variance of the estimator has a very complex 
structure; therefore, it is inconvenient to construct confidence region based 
on normal approximation. Hence, we propose using an empirical likelihood- 
based confidence region that avoids estimating the limiting variance and has 
better accuracy. Many advantages of empirical likelihood over the normal 
approximation-based method have been shown in the literature. In partic- 
ular, it does not impose any prior constraints on the shape of the region, 
does not require the construction of a pivotal quantity and is range pre- 
serving and transformation respecting [see Hall and La Scala (1990)]. Owen 
(1991) applied the empirical likelihood to a linear regression model and 
proved that the empirical loglikelihood ratio is, asymptotically, a standard 
X^-variable. Owen (2001) is a fairly comprehensive reference. Using empir- 
ical likelihood seems routine. However, a somewhat surprising result about 
our proposed method is that, although plug- in nonparametric estimators 
are used for infinite-dimensional nuisance distorting functions, the empir- 
ical likelihood ratio for the regression parameters is still of limit. This 
is very different from the existing literatures, because when nonparametric 
nuisance functions are replaced by plug-in estimators, the limit is often not 
tractable Chi-squares anymore, unless bias correction is implemented [see, 
e.g., Xue and Zhu (2007) and Zhu and Xue (2006) for details]. 

The rest of the paper is organized as follows. In Section 2, we describe the 
estimation procedure and the associated asymptotic results and discuss the 
efficiency of the estimators. In Section 3, we construct empirical likelihood 
based confidence regions for the parameters. In Section 4, some simulations 
are carried out to assess the performance of the proposed estimators and 
confidence regions. The application to the GFR and SCr data is presented 
in Section 5. Section 6 contains some concluding remarks. The technical 
proofs of all the asymptotic results are provided in the Appendix. 

2. Point estimation and asymptotic behavior. As in Sentiirk and Miiller 
(2005), we assume that the mean distorting effect vanishes, that is, 
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Besides, other basic assumptions are that 

(b) {Xr,U,e) are mutually independent, 0^,U) are independent, 

(c) E[e] = 0, Var(e)=cr2. 

Assume that the available data are of the form {(Ui,Xi,Yi), 1 < i < n}, 
for a sample of size n, where Xi = (Xu, . . . ,XqiY are the g-dimensional 

observed predictors. The unobservable versions of {Xi,Yi) are denoted by 
(Xi,Yi). We write model (1) in a sample form i = 1, . . . ,n, 

(Yi = f{Xi,f3)+e„ 
(3) lYi = ^{Ui)Yi, 

[ Xri = 4>r{Ui)Xri, r = l,...,q, 

where {e^,! < i < n} are independent and identically distributed random 
errors. ^ ^ 

Now, our objective, based on the observations {{Ui, Xi,Yi), I <i < n}, is 
to estimate the unknown parameter vector (3. Prom assumption (b). 

For convenience, we denote the density function of U by p{U) and define 
gY{U)=E[Y\U]piU), 

(5) 

gr(,U)=E[Xr\U]p{U), l<r<q. 

There are some existing methods of estimating ^(f/) and (priU) [for de- 
tails, see Eagleson and Miiller (1997)]. Herein, we adopt a kernel method 
commonly used for ease of exposition. Then, for 1 <r <q, 

J ^ l/{nh)Y.UK{{u-U.i)/h)Y, \_ ^ gviul \_ 

^ l/{nh)Y.UK{{u-Ui)/h) f p{u) r 

(6) 

: ^ l/{nh)Y.UK{{n-Ui)/h)Xr^ 1 ^gM 1 

^''^^ l/{nh)EUK{{u-U.)/h) 'x^ p{u) 1/ 

where Y = \ Yll=i^i: = ^Yl7=i^ri, h is a, bandwidth, and K{-) is a 
kernel function. 
Let 

(7) Y, = YS{Ui), Xr^ = Xr^/MU^) and Xi = {Xu, . . . , Xg,Y . 

Then, the nonlinear least squares estimator (3 for model (1) can be defined 
as the solution of the p equations 

(8) Gt{P) = t{Y.-f{X.,f3))^i^^ 
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for k = 1, . . . ,p, where df{-,P)/d(3k is the partial derivative of / with respect 
to f3k- Because / is nonhnear in f3, no expHcit solution can be calculated, 
and an iterative procedure is needed instead. 

Observe that the asymptotic properties of (3 defined by (8) depend on 
those of estimating functions G^(/3). An approximation to G^(/9) is provided 
in the following proposition, which is the key to obtaining the asymptotics 
of (3 later on. The detailed proof is given in the Appendix. 

Proposition 1. Under conditions (Al) and (A5)-(A8) in the Appendix, 
we have, for 1< k <p, 

(9) n~'G^M = n-'RtiP) + op{n~^/^) 
and 

(10) n-^GnmGl{(3) = n-^RMKi^) + opil), 

where Gnif3) = iGii(3),...,GPif3)r, Rni(3) = (RUO) , . . . , RUPW ^nd 

n 

Ri{P)=Y,edp,{X,,P) 

i=l 

E[Xif,^{X,(3)fp^{X,(3)] \ 

where fp^, , fxi denote the first derivative of f with respect to Pk o.'i^d Xr , 
respectively, l<k<p, l<r <q. 

Remark 1. When specialized to the covariate-adjusted linear regression 
model, f{X, 13) = (3q + X]f=i Pi^i- The estimating equation Rn{f3) is 



Ri{P) = Y,eiXk^ 



i=l 



4|:(«-i>-i.l^-|:«.^..-m.^) 
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and, for Pq, Xq is equal to 1. 

Root n-consistency and asymptotic normality of ^ defined by G'^{/3) will 
be established in the following theorem, and the conditions required are given 
in the Appendix. Here, we introduce the following notation {1 <s,k <p)- 

A{s,k)=K[ff,^{X,(3^)ff,^{X,p% 

Ck=nft3,{^,p% c=(Ci,...,Cpr, 

(11) Vk = , v = im,---,Vp) , 



U^i-nx,]) 

E[y/^,(X,/30)] 



x{{Y-E[Y])- 



E[Y] 

E[Xif,^{X,(3')f^^{X,f3^)] 



E[Y] 

E[Xif,,{X,(3')fp^{X,(3')]' 



1=1 ; 

X-(Y ^, E[XiUX,f3'^)f^^{X,(3°)] \\ 

Theorem 1. Let (3 be defined by (8). If conditions (Al)-(A8) in the 
Appendix are satisfied, the following results hold: 

(i) (3 converges in probability to the true value [3^ ; 
(ii) 

(12) ^^^_f3P)^N{Q,^), 
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where the covariance matrix can be represented as sum o/S = cr^A + 
B + C with 

C = a2A-i(lr?r + iC^-)A-i. 

The proof is deferred to the Appendix. Let us consider the terms in the 
expression for the asymptotic covariance matrix in Theorem 1. If there are 
no distortions with tjj = cp^. = 1, and Y = Y and Xr = Xr, then we can es- 
timate (3 by least squares, minimizing ^27=10^1 ~ fi^i^P))'^ with respect 
to /3 € O. The term cr^A~^ equals the asymptotic covariance matrix of this 
least squares estimator. The matrix A + B is caused by the distortions. The 
correlation between the first two terms of RniP) leads to C. 

Remark 2. When the model is linear, the limiting variance matrix is 
reduced to 



(13) 



a^A~^ + A + B + 



2E[y] 



where (0 < s. A; < p and Xq = 1) 
A(s,A;)=E[X,Xfc], 



1 



A{s,k) = -B'',0lM 



Y - E[y] X, - E[X,] \(Y- E[y] Xfc - E[Xfc] 



B{s,k) = 0i0iR 



E[y] 

Y-Y X, 



X, 



E[y] 

eo = (l,0,...,0)^ 



E[X, 



E[X,] A W\ 

Y-Y Xk-Xk 
~W] E[X,.] 



E[Xfc] 



Then, the asymptotic variance of y/n{j3k — fi^) obtained from our new method 
is given by, < /c < p 



(VI) : 



a2 = c72(A-i)oo + ^/?o°'lE 
al = a\A-%u + \pf^ 



Y-Y 



y-E[y]y 

y-E[y] Xk-nXkV^ 



+ 



E[y]^o' 



+ /3fE 



E[y] 

Y-Y Xk-Xk^^ 



mx. 



E[y] 



nxk 



Now, we are in the position to make a comparison between the naive esti- 
mators (iTq, af,) and the transformation-based estimators ((Tq, erf,)- According 
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to the results of Sentiirk and Miiller (2006), the asymptotic variance of the 
transformation-based estimators is (with the correction of a typographical 
error in their paper) 



(V2): 



irl = a\A-^)oo + ^H^~')oo Var[^(i7)] + /J" Var[V(C/)], 
al = a\A~\k + a2(A-i)fcfc VariV'Cf/)] 



The following result states a necessary and sufficient condition to judge when 
the naive estimators are more efficient with smaller limiting variance for 
this case. To formulate this result, we write A as A = (Aq, Ai, . . . , Ap), with 
Afc being (p + l)-dimensional column vector. Let = (0, . . . , 0, 1, 0, . . . , 0)"^, 
which is of p + 1 dimension and only (fe + l)th element is 1. 

Theorem 2. Treating as (/)o{-) = 1 and Xq = 1, then af. < af,, if and 
only if, 

(14) {/3:/3"Cfe/3<0,0<fc<p}, 
where the matrix Ck is 

-4a2Var[V'(f/)](A-i)fe4AoA5 



+ a\-3 + A¥.[^^{U)])ekel + 2a\ekAl + Akel)I{k = 0) 
+ (3-4E[V'(C/).^fc(C/)])^/3|(AfcA5 + AoA^) 

Furthermore, the matrices 's are symmetric with at least one negative 
eigenvalue. 

Prom this result, we realize that the limiting variance of the naive esti- 
mator relies on all the coefficients f3k', thus, no one can be uniformly more 
efficient than the other, in general. However, in the case without distortion, 
our method would perform worse than the transformation based method. 
The following examples show what the sufficient and necessary conditions 
for the naive estimator that has smaller limiting variance will reduce to, 
assuming that at least one variable in the covariate-adjusted linear model is 
contaminated. 
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Example 2.1. Consider the simplest covariate-adjusted linear model 
with p= I. Then, 

(2.12) }y = ^P{U)Y, 

[x = cPiU)X. 

The condition of Theorem 2 is 

R = {(/?o, A) : (/3o, /3i)Cfc(/3o, f3iY < 0, < < 1} 

with the elements of matrices C^s defined by 

C„(M) = .^(5-4V»wffi|!). 

Co(l, 2) = Co(2, 1) = a^E[X] 2 - 4 Var[^] 



Var[X] 

Co(2,2) =/32Var[X](4Var[V'] + 1), 

Ci(l, 1) = /3fi-3 - 4E(V^2) ^ 8]E(^^))^^ _ y^^^^j 



E[X])2 ^^^Var[X]' 



Ci(l, 2) = Ci(2, 1) = -4/3i^(E[V'1 - E[V<^])^;^ - 4a^ Var[V^] 
Ci(2,2) =(72(^1 -4Var['0] 



E[X] '^'Var[X]' 

(E[X])2^ 



Var[X] 

Unfortunately, the matrices C^s are rather complex and depend on the 
following many quantities: E[X], E[X2], Yav[ip{U)], K[^{U)(p{U)], and 
the unknown parameters Pq and Pi. It is not easy to directly imagine the 
region R. Figure 1 is helpful to understand the performance of two meth- 
ods under the model assumptions that E[X] = 2, E[X'^] = 5.144, = 0.25, 
Var['0(C/)] = 0.08 and E[ij{U)(l){U)] = 1. 



3. Empirical likelihood-based confidence region. In this section, we dis- 
cuss confidence region construction. Because we have obtained the asymp- 
totic normality in the above section, normal approximation is a natural 
approach. However, as is shown in the theorem, the matrix S is rather com- 
plex and includes several unknown components to be estimated. If we use 
normal approximation to construct a confidence region for /3, we need a 
plug-in estimation that involves the estimation for many unknown compo- 
nents. For comparison with the following empirical likelihood, we will still 
present normal approximation in the next section. 
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Fig. 1. Region Rl corresponds to the case where the direct method performs better; R2 to 
that where the direct method performs better only for /3i ; R3 to that where the direct method 
performs better only for /3o ; R4 to that where Sentiirk and Miiller's method performs better. 



To construct a confidence region for (3, we first introduce an auxiliary ran- 
dom variable e^,(/3) = (y, - /3))^^g^ and^AP) = (^.C/^), • • • , C.(/3)r 
Note that E[^„^j(/3)] = if /3 = /j". Using this, an empirical log-likehhood ra- 
tio function is defined as 

( n n n 

IniP) = -2max<^ ^log(npi) :pi > 0,^pi = l,^piCn,i{P) = f- 



i=l i=l 



Since Yi and Xi in ln{P) are unobservable, a natural method is to replace 
them by the estimates Yi and Xi, respectively. A plug- in empirical log- 
likehhood, say is 

f n n n ^ 

(15) /(/3) = -2maxi ^log(np,) > 0,^p, = 1, ^p,G„,i(/3) = , 



.i=l 



i=l 1=1 



where G„,(/3) = {Gl^iP), . . ■ ,G^,,(/3))- and = {%- f{X,,f3)f-M§fl. 

A; = 1, ... ,p. If we apply the standard method of Lagrange multipliers to find 
the optimal pi in l{f3), the log empirical likelihood ratio is 



(16) 



f(/3) = 2^1og{l + A^G„,,(/3)}, 



i=l 
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where A is determined by 



1 



0. 



Theorem 3. Assume that conditions (Al) and (A5)-(A8) hold. /3° de- 
notes the true parameter value. Then, 

(18) k(3')^xl. 

Based on the above theorem, an empirical Hkehhood confidence region for 
f3 with nominal confidence level a is 

(19) Ia,EL = {f3:m<Ca}, 



a. 



where Cq satisfies P{Xp < Cq,) = 1 

We note that, although the estimated Yi and Xi that involve the estima- 
tion for infinite-dimensional nuisance parameters (distorting functions) are 
used, the Wilks' theorem still holds. This is very different from all of the 
existing results that need plug- in estimation for nuisance functions, unless 
bias correction is used. Xue and Zhu (2007) and Zhu and Xue (2006) show 
the necessity of bias correction in the relevant settings. This result is of 
importance for the use of empirical likelihood in semiparametric problems, 
because it tells us that we may not be able to give a unified theorem to deal 
with infinite-dimensional nuisance parameters. It merits further study. 

We now consider normal approximation for constructing a confidence re- 
gion for (3. As we mentioned before, we have to estimate the unknowns in 
S given in Theorem 1. Plugging the consistent estimators o"^. A, 57, F and 
ft into S, we can obtain an estimator S of S. A confidence region with the 
approximate nominal coverage 1 — a can be provided as 

/a,asy = W : n(^ - (3Yt~\^ - (3) < C„}. 

4. Simulation study. In this section, we carry out simulations to inves- 
tigate the performance of our proposed methods as outlined in Sections 2 
and 3. Here, we choose a higher order kernel function K{t) = 11(3 — 7t^)(l — 
t^)I{\A ^ 1) ^iiid use the leave-one-out cross-validation to select the optimal 
estimated bandwidth. 



Example 4.1. Consider the nonlinear regression model 
(20) y = /3i(l-exp(-/32X)) + e, 

where /3i = 4.309, /?2 = 0.208, e ~ iV(0, 2.0639) and X is drawn from a nor- 
mal distribution ~ A^(6.8,26) truncated in interval [0.45,25]. Assume that 
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Fig. 2. Confidence regions for Example 4-1- The solid and dash-dotted lines correspond 
to the empirical likelihood method and normal approximation method, respectively. The 
point "+ " is the true value. 



the distortion variable U ~ A^(0,6) truncated in [0,6], and the distortion 
functions are tp{U) = {U + 1)2/27.8170 and (/>([/) =JU + l)/4.9459, which 
satisfy the identity conditions. Then, 500 samples of Y and X were simulated 
from the specified distributions with sample sizes 200, 400 and 600. The es- 
timated mean-squared errors for the estimators of Pi and P2 sue (0.1983, 
0.0717, 0.0315), (0.0022, 0.0006, 0.0006), respectively, for the sample sizes 
n = 200, 400, 600. 

Figure 2 reports a comparison of the two methods for constructing con- 
fidence regions, which are normal approximation and empirical likelihood, 
where the sample size is 400. The empirical likelihood provides a comparable 
confidence region as that of normal approximation. The coverage probabil- 
ities are reported in Table 1. We can see that the coverage probabilities of 
the empirical likelihood are uniformly higher than those of the normal ap- 
proximation, which approach the nominal level as n increases. Thus, in this 
example, the empirical likelihood is clearly superior to the normal approxi- 
mation. 



Example 4.2. 
sion model is 

(21) 



Assume that the underlying unobserved multiple regres- 

y = /3i(l + X)^2 +e, 
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where X arises from a uniform distribution with expectation 7/3 and vari- 
ance 19/12, /3i = 2.5, (32 = —1 and e ~ A^(0,4). The confounding covariate 
U was simulated from Un(4 — \/7, 4 + \/7), and the distortion functions were 
chosen as 7p{U) = {U + 10)Vl94.9160 and = {U + 34)/37.9160, which 
satisfy the identifiabihty conditions. Again, we conducted 500 simulation 
runs with sample sizes 200, 400 and 600. The estimated mean-squared er- 
rors for the estimators of pi and (32 are (0.0121, 0.0054, 0.0038), (0.0439, 
0.0126, 0.0123), respectively, for the sample sizes n = 200,400 and 600. The 
confidence regions, constructed by two methods involving empirical likeli- 
hood method and normal approximation method, are shown in Figure 3 with 
the sample size being 400. We can see that the region based on the normal 
approximation is larger with the slightly lower coverage probabilities than 
those of the empirical likelihood, which are reported in Table 2. Thus, again, 
the empirical likelihood performs better. 

5. Application. As an illustration of our method, we apply it to the 
baseline data collected from studies A and B of the Modification of Diet 
in Renal Disease (MDRD) Study [see Rosman et al. (1984) and Levey et al. 
(1994)]. The main goal of the original study was to demonstrate that dietary 
protein restriction can slow down the decline of the glomerular filtration 
rate (GFR). Here, we use the baseline unadjusted glomerular filtration rate 
(GFR) and serum creatinine (SCr) data to show that the relationship of the 



Table 1 

For Example 4-1, the coverage probabilities of the 
confidence regions on (/3i,/32)^ 







95% 




Method 


n = 200 


n = 400 


n = 600 


Empirical likelihood 
Normal approximation 


0.9140 
0.8520 


0.9260 
0.8840 


0.9360 
0.9340 


Table 2 
For Example 4-2, the coverage 
confidence regions on 


probabilities 


of the 






95% 




Method 


n = 200 


n = 400 


n = 600 


Empirical likelihood 
Normal approximation 


0.9340 
0.9340 


0.9360 
0.9260 


0.9420 
0.9360 
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E-LSE 




e-ELR 
true value 



1.8 2 2.2 2.4 2.6 2.8 3 3.2 

Fig. 3. Confidence regions for Example 4-2. The solid and dash-dotted lines correspond 
to the empirical likelihood method and normal approximation method, respectively. The 
point "+ " is the true value. 

two variables can be substantially improved by correcting for the distorting 
effect of body surface area [BSA: BSA{m^) = 0.007184 * Kg^-^^s ^ cm^-^^Sj^ 
After excluding a few missing data and couple outliers, the data set includes 
819 subjects. Figure 4 depicts these original data. The main outcome in this 
example is the unadjusted GFR, which is measured as mL/min. 

For these data, let the observable response Y be the unadjusted GFR and 
the observable predictor X be the serum SCr. The consideration that the 
distortions of the response and predictor are unknown together with the fact 
that the relationship of GFR and SCr is nonlinear [see Levey et al. (1999) 
and Andrew et al. (2004)] motivates us to fit the serum creatinine data by 
a nonlinear model where the confounding variable U is taken to be BSA. A 
relevant research [Ma et al. (2006)] showed that a nonlinear model for such 
kind of data is needed. Figure 5 presents the scatter plot of the logarithm of 
GFR against SCr, which shows a quadratic curve. Thus, we used a second 
order polynomial of SCr to fit an exponential model 

(22) f{X,P)=P^eM-p2X-/3sX^) + (3^. 

A similar model, based on different transformations of SCr where they 
simply divided the unadjusted GFR by 1.73 times of BSA, was used in 
Levey et al. (1999). In this paper, instead of calculating the adjusted GFR 
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Fig. 4. The scatter plot based on the data of original GFR and original SCr. 
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Fig. 5. The scatter plot of logarithm of adjusted GFR against adjusted SCr. 



by dividing it by 1.73 times the BSA, as is commonly done in medical lit- 
erature, we allow the distorting function to be nonlinear and estimate it 
by kernel smoothing. The distorting function between SCr and BSA is also 
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estimated using kernel smoothing. The kernel and bandwidth selector cho- 
sen are the same as those in Section 4. The increasing patterns of tp{u) 
and (/>(ti), which are displayed in Figures 6 and 7, provide evidence that 
ipiu) and (j){u) are not constant for these data. Therefore, the covariate- 
adjusted regression model is preferred over the existing methods that divide 
the GFR by 1.73 times the BSA. The restored data are then obtained by 
(7). We set the starting value of /3 as (1,0,0,1)'^ and the final estimate is 
^ = (10.7100,0.0759,-0.0004,1.1528)^. The adjusted data using our pro- 
posed method and the nonlinear fitted curve are shown in Figure 8. After 
corrected for the BSA effects on both the response and predictor, the scatter 
plot shows that the adjusted data are closer to the fitted nonlinear curve, 
suggesting a more distinct relationship between GFR and SCr. This leads 
to a more accurate estimate of GFR using SCr. 

6. Concluding remarks. For the models with distorting functions in re- 
sponse and predictors, we have proposed a direct approach to estimate and 
to make inference on the parameters of interest. For these models, there are 
several interesting issues that merit further studies. We recognize a merit in 
the original transformation method proposed by Sentiirk and Miiller (2005), 
which does not require divisions of the distorted variables by the estimated 
distorting functions, and the estimation may therefore be more numerically 
stable, compared to our proposed direct method, when some of the values 
of the distorting functions are closed to zero. How to inherit this merit in 
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Fig. 6. The kernel estimates based on the data of original GFR and BSA. 
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developing other more general estimation methods is an interesting topic. 
For the direct method, a practical remedy is to use their estimators plus a 
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small constant, say, 1/n. This is a commonly used remedy in nonparametric 
regression estimation [see, e.g., Zhu and Fang (1996)]. It will not damage 
their asymptotic properties. Another possible extension for the covariate- 
adjusted regression is to consider a model in which the distorted variables 
also have additive measurement errors. When the relationship between the 
underlying response and unobservable predictors is linear, it is possible to 
develop a method to estimate the parameters. However, the estimation may 
be a great challenge when the underlying relationship is nonlinear. Another 
related topic deserving further attention is on model checking when applying 
these methods to real data. 



APPENDIX 

The following conditions are needed for the results: 

(Al) For all si, 52,53,54 = 0,1,2, si + S2 + S3 + S4 < 3, 1 < ki,k2 < p, 
1 < ^ii '2 < the derivatives 



exist. 



and 



9^1+^2+^3+^4 



9^1 /3fc, d'-^xi^ d'^xi. 



E< sup 

I 13 



5^1+^2+^3+«4j(X,/3) 



when S3 + S4 > 1 



< 00, 



9^i/3fci d'2Pk2 d'-'xi, d'^xi^ 

when 1 < si + S2 < 2 and S3 + S4 = 0; 



(A2) E[e^] <oo; 

(A3) Suppose A(s,A:) =E[/^,(X,/30)/;3,(^,/3°)] <oo for s,k = l,...,p, 
and the matrix A is positive definite; 

(A4) S{i3,(3q) =E[/(A', /3) — /(X, /3o)]^ admits one unique minimum at 

/3 = /3o; 

(A5) All gr{u) = (j)r{u)p{u), 1 < r < q, gy^u) = ip{u)p{u), (priu), i^iu) and 
p{u) are greater than a positive constant, are differential, and their deriva- 
tives satisfy the condition that there exits a neighborhood of the origin, say 
A, and a constant c > such that, for any 6 £ A, 

\gi?\u + 5)-gP{u)\<c\5\, l<r<q, 
\g'^\u + 6)-g'^\u)\<c\6\, 



p^^\u + 6) - p^'^> {u)\ <c\6\; 



(3)/ 
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(A6) The continuous kernel function K{-) satisfies the following proper- 
ties: 

(al) the support of K{-) is the interval [—1, 1], 

(a2) K{-) is symmetric about zero, 

(a3) J\ K{u) du = 1, J\ u^K{u) du = Q,i = l, 2, 3; 

(A7) As n — > oo, the bandwidth h is in the range from 0(n~^/^logn) to 

(A8) E[y] and E[X^] are bounded away from 0, ^\Yf < oo and E[Xr]2 < 
oo. 

These conditions are mild and can be satisfied in most of circumstances. 
Conditions (A1)-(A4) are essential for the asymptotic results of nonlinear 
least squares estimation. Condition (A5) is related to smoothness of the 
functions 5r(") and gvi') and the density function p(-) of U. Conditions 
(A6)-(A7) are commonly assumed for the root n consistency of kernel based 
estimation [see, e.g., Zhu and Fang (1996) and Stoker and Hardle (1989)]. 
Condition (A8) is special for this problem see Sentiirk and Miiller (2006)] . 

We start with a lemma, which is frequently used in the process of the 
proof. Then, we proceed to the proof of Proposition 1 and Theorem 1, con- 
cerning the consistency and asymptotic normality. 

Lemma A.l. Let r]{x) be a continuous function satisfying E[r/(X)]^ < 
oo. Assume that conditions (A5)-(A8) hold. The following asymptotic rep- 
resentation holds: 

i=l ^ ' 

When replacing Yi,Yi,Yi with X^i, X^i and X^i, respectively, we can have 
similar asymptotic representation. 

Proof of Lemma A.l. Recall that % = Yi/ip{Ui), which is given by 
(7) and (6). Simple calculations give the expression 

n n n 

J2{Y, - Yi}rj{X,) = Y,YMX^) 

i=l 1=1 i=l 

Thus, it suffices to deal with the quantity J2i'=i^i'n{^i)- According to (6), 
it holds that 

n 
i=l 
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h \9YmJ 

(23) 



= Li + L2. 

First, we analyze Li. The proof is divided into three steps. Denote by 
Lii, Li2 and L13 the quantities 

(24) - - 

Step 1. Show that 

(25) Li = Lii-Li2 + Li3 + op(V^). 
Applying the equation '0(-) = 3\ , we have 



Li = Y,yMx^ 



^ * Ti/Tr.^ r,.. TT.\ \ r,.r TJ.\ 



where 



.^^ p(C/i) 5y(t^i) V <?y(C/^) p{Ui) 

±YMX.) -ty^r^iX.f^.+tYMX.f^^^ 

i=l i=l 

+ Lf 1 - Lf 2 

Ln - L12 + Li3 + Lf 1 - Lf ^ 



Lf^=Y,YMX^ 



i=l 



9Y{Ui)gY{Ui) 
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Equation (25) can be concluded by proving 

(26) Lf' =op(ni/2), l<i<2. 

Invoking Theorem 2.1.8 of Rao (1983), Lemma 3 of Zhu and Fang (1996), 
condition (A7) and the Law of Large Numbers (LLN) for ^ J2i=i ^^(^«), we 
then fohow the arguments used in Zhu and Fang (1996) to yield the higher 
order term 

Lf ^ = Op (^{h^ + n-'/^h'^ logn)2|f]y,r?(X0 

= op(nV2). 
Similar arguments yield (26) for i = 2. 



Step 2. Show that 

n 1 n 



1^ 1 



l/2^ 



Ln = ^Y.^^r|{Xi) + -Y,nYr^{X)]+op{n , 

1=1 1=1 

To analyze L12, we need to work on 

. gym 
arm- 

Again, applying the similar arguments used by Zhu and Fang (1996), we 
approximate the above sum by a [/-statistic with a varying kernel with the 
bandwidth h. We can then have the asymptotic representation [see, e.g., 
Serfling (1980)] as 



i=l 



9Y{Ui) 

-Y^-J YMX.)—^^K {-^J yi;{u)py,u{y, u) dy du 



+ I Y,i:{Ui)^K(\^\tpT,u{t,u)dtdu + op{n^l^) 

with PY,u{y, u) being the density function of {Y, U),T = Y7]{X) and PT,u{i-, u) 
the density of variable {T,U). Note that Y and U are independent, T and 
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U are independent. Invoking conditions (A5)-(A8), we obtain 

i=l 1=1 L J 

The result of (27) for L13 can be similarly proved. Combining (25) with (27), 
we have 

n 

Li = Y,iYir^iX,)-E[Yr]iX)]) 

i=l 

+ nE[Yr]{X)]+op{n^/^). 

Step 3. Show that 
(29) L, = ±iY.-E[Y])^^^l^+opiny^). 

i=l L J 

Prom (28) and the definition of L2 in (23), we derive that. 



Y-E[Y] 
- E[Y] 

= Y^iYMX.)-E[Yv{X)])- ^ J 



i=l 



E[Y] 

lv(r.-E[r|)Hi:^izJElIl 

2^^ ^ E[F] E[y] 



+ nE[yr?(X)]^^J^+op(nV2) 
E(?.-K[r])™ + oHnV^) 
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where the last equahty is obtained by applying LLN to ^-fjyp^i ^ J27=i0^i'n{^i 
E[Yv{X)]) andiEr=im-E[y]). 

Finally, together with the asymptotic representation of Li and L2 in (28) 
and (29), the desired result is easy to arrive at. □ 

Proof of Proposition 1. 

Proof OF (9). Start by writing G^(/3) =1)1 + 1)2 + ^^3 by using Taylor 
expansion, where 

" g/(X.,/3) " y. df{X,,(3) 

~ht} '^^^ 

V 9(^k fr{ d[3kdxi J 
D, = jz [e. + {% -Y.)-t ^^^^^(^^^ - ^'^)) 

where X** = {XI*, X**y and X* = {X^^, X^^Y with both X^* and 
Xi^ are two points between Xn and Xn. Applying Lemma A.l to the sec- 
ond term of Di with ri{X) = and to the third term with r](X) 
df{X,f3) df{X,f3) 



dxi ' 



we can conclude that 



+ Z^(^i - iuTT^ + op(Vn) 
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2^V ^ L J. jgJ^J 



E[y(5/(X,/3))/9/3fcj 



+ iJJi - ^i) + op(Vri), 

i=\ z=i ^ 'J 

" ^ ~ ^ , E[XKa/(X,/3))/9/?fc(a/(X,/3))/ax;] , 

+ !L z^(^«i - ^li) -^^^ + op(v?^). 

To accomplish the proof, we only need to show that both D2 and Dj, 
are asymptotically negligible. Note the arguments, which can be found in 
Zhu and Fang (1996), 

sup \p{u) — = 0{h^ + n~^l'^h^'^ logn), a.s., 

(30) 

sup I (n) — (7Y(ti) I = Op (/i^ + n ^I'^h ^logn). 

u 

Combining this with conditions (Al) and (A7)-(A8), we have 



= Op {h^ + n-^l^h-^ log n)2 E E E ^ 



n q q 



dl3k 

- Op ( (/.^ + n- log n)3 J ^ ^ ^ C n 



\ k«=l 1=1 r=l ) / 

= Op((/i^ + n~^/^h~^ logn)2n) + Op{{h^ + n^^'^h"^ lognfn) 
= opin'/^). 

Similarly, is op(n^/^). The desired result follows from above analysis. 
Proof of (10). Note that RniP) = Op{n^/^) and 

-G„(/3)G';(/3) 

n 

= -Rn{fi)K{P) + -(Gn(/3) - RnmK{f3) 

n n 

+ ^Rn{l3){Gn{l3) - Rn{f3)T + ^(G„(/3) - i?„(/3))(G„(/3) - i?n(/9))". 
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Combining this with (9), it can immediately lead to the desired result of 
(10). □ 

Proof of Theorem 1. 

Proof of (i). Denote by 5'„(/3) the quantity 5„(/3|{Xi, y^) = Er=i(^i- 
/(Xj,/3))^. The main point of the proof lies in stating that, for any (3 in G, 

Sn{P\{Xi^Yi\) — Sn{l3\{Xi^Yi}) = op{n). This implies, by the fact that /3 = 
argmin^ge S'„,(/3|{Xj, 1^}) and the arguments used in Wu (1981) under con- 
ditions (Al)-(A4), that Sn{f3\{Xi,Yi\) converges in probability to 5(/3,/3°), 
admitting a unique minimum at (3 = (3^ . The consistency of /3 follows from 
Lemma 1 of Wu (1981). Now, after simple calculations, we can obtain the 
decomposition 5„(/3|{l„ y^}) - S„(/3|{Xi, y,}) = Fi + Fa + F3 + F4 + F5, 
where 



Fl = $:(y,-y,)^ 

i=l 

n 

F2 = J2if{Xi,f3)-fiX„P)f, 

i=l 

n 

1=1 

n 

F4 = -2Y,{f{X,,l3) - f{Xi,f3))(Yi - Y^, 



1=1 



1=1 



Note that /(X„/3) - /(X„/3) = " ^u) ^ with X* being a 



F, = -2J2ifiX^,P)- fiXi,(3))ei. 

point between Xi and Xi. Condition (A7) and (30) ensure that 
Fi = Op i^{h^ + n-^l^h-^ \ognf Y^^ 

F.^o.((.^.n-v^.-iog.)^{|:(|:^)^; 



:op(nV2). 



.i=l 
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Arguing as for Fi and F2, we can easily obtain F3 = op{n) and = op{n^^^). 
For F5, given the set S = {{Xi, Ui),l <i <n}, we can derive that 

F5 = ^(^5 1 5) + Op (Var(F5 1 5)) 
= Op (Var(F5 1 5)) 



Opij2{f{X^,(3)-f{X„P)y 



\i=l 



Proof of (ii). Denote the estimating function vector {R\{(3), . . . , RP{P)y 
by RniP) and {Gi{(3) , . . . , GP{l3)y by G„(/3). From the mean-value theo- 
rem, there exists a (3* such that 



(31) 



9/3 



where the {s,k) element of matrix ^^^j^ ^ is 



dp 



{s,k) 



^ df{X,,(3*)df{X,,(3*) , ^^^y ^ dy{X,,(3n 

^ dP^s + Z^y^i- J'^^i^f^ ))' 



i=l 



1 



i=l 



+ Y^{%- f{X,,l3*))Y,{Xu- Xu) 



i=l 



1=1 



dPs dl3k dxi 



and (3* lies between /3 and X^^ between Xu and Xu. We want to show 
that 



(32) sup 
/3 



I dGM , 1 " 9/(X„/3)9/(X„/3) 

n 5/3 3/3, 



op(l). 



It suffices to prove that 



sup 
/3 



1 " df{Xi,f5) df{Xi,p) 1 " a/(x,,/3) df{x„f3) 

nfr[ df3s dl3k n f-; 5/3, df3k 



sup 

/3 



sup 

13 



0P(1), 
0P(1), 
Op(l). 
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Invoking the regularity condition (Al) on / and mimicking the proof of Lem- 
ma A.l, the second and third equation can be easily proved. There exist two 
points and X** , so that the left-hand side of the first equation is bounded 
by 

" " d^f{X*,(3)df{X,,f3)^ 



sup 

/3 



2 y 



d(3k 



- {Xu — Xu 



+ sup 



-|- sup 

/3 



n q 



df{x„p) d'f{xr,(3) 



d(3k dxi 



{Xii — Xii] 



n 



fi dPs dxi 



^fr[ dPk dxi 

Again, by condition (Al) and Lemma A.l, every term above is of order 
op(l). Recalling the definition of A provided in (11) and combining this 
with (32) and the consistency of /3, we have 

1 OGnifB* 



(33) 



A + op(l). 



n d(3 

From Proposition 1, Gn{l3) = Rn{l3) + op{n^^'^). Then, (31) and (33) en- 



V^iP - /30) = (A + op{l)y^^Gn{P°) 

= (A + op{l)r' f ^i?n(/3°) + op(i; 



sure that 



(34) 



which entails the result. □ 

Proof of Theorem 2. With Xq = 1 and 4>o{u) = 1, we can easily see, 
first, that o"^ — (t| > is equivalent to 

(T (A )kk + 7=^fikl{k = 0) -/3fcE 



A-i^i?„(/30) + op(l), 
Jn 



E[y]' 

Y -Y Afe-Afe^2 

'W\ nxk] 



E[A,.] 



< a\K-%k + cy\^r%k Var[^(i7)] + fil 



■Var[V'(C/)-(/>fc(t/)]. 
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After some algebraic calculations, it is also equivalent to 



<<T2(A-')nVar[i('(t/)], 

Then, multiplying the two sides of the above inequality by (E[y])^, and 
noticing that E[y2] = /3^A/3 + a"^ ^ (E[y])2 = /^^AqAJ/?, and E[yA:fc]E[y] = 
/3^(^Afc X Ag + ^AoAp/3, we can obtain that cr^ — cr| > is equivalent to 

(35) /3^Cfc/3<0, 0<A:<p, 

where Cfc is given as 



(-3 - m^\U)] + 8E[VXC/)./'fe(t/)])7^i7^/5f 



-4cT2Var[V'(C/)](A-i)fe,jAoA5 

+ a2(-3 + 4E[V^2^;7)])efce^ + 2a\ekkl + Akel)I{k = 0) 

+ (3 - 4E[V^(C/),^fc(C/)])^/32(AfcA5 + AoA^) 

+ (-3 + 4E[V'2(C/)])/32A. 

Clearly, the matrices C^s are symmetric. In the following, we verify that 
the set about (3 satisfying the above p+1 inequalities is not empty. Because 
Cfc involves the parameter (3k, (35) is, indeed, not a standard quadric form. 
Instead, we can investigate the equivalent standard one, which is 

(3lCk(3k<0, 0<k<p 

with f3f. = (3/ fik- Pk varies with k, but their directions are the same. This 
means that it suffices to consider (35). To prove (35) nonempty, we verify 
that there exists at least one negative eigenvalue of matrices C^s. For any n x 
n matrix, denote by A^ax 

(•) and Amin(-) its largest and smallest eigenvalues, 
respectively. To prove this, we only need to prove that 

(36) Arnm(Cfc)<0, 0<k<p. 
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Note that, for any given symmetric matrix B, it always holds that Amin(-B) < 
Bii < A 

max 

{B). So, we obtain that, 

Amin(Cfc) < (Cfc)oo 

= Pi (i - (1^) mi^'m - 8nm)Mu)] + 3) 

- 4cT2(A-i)fcfc Var[V'(f/)] + ^2(4 Var[^(;7)] + 5)/(A; = 0). 
If the distorting functions ip{u) and (j)k{u) satisfy 

iEliP^U)] - 8E[V(C/)0fc(C/)] + 3 > 

and 

Var[V'(t/)] > ^ 



4((A-i)oo-l)^ 



then (Cfc)oo < 0, and then AminlCfc) < 0. (36) is proved; that is, the set 
confined by the conditions of Theorem 2 is not empty. □ 

Proof of Theorem 3. By some elementary calculations, we can have 



G„,,(/3) = (y,-/(X„/3))- 



-(/(X„/3)-/(X„/3)) 
(37) + {Yi - Yi 



dl3 



d(3 



+ {Yi-f{X„P)) 



dp 

df{X.,P) df{X„(3) 



8(3 8(3 



= (y, - /(X„ /3))^:&^ - + + E,,. 



We now show that 



(38) max |G„,i(/3)|=op(n^/^). 

l<j<n 

It is well known that, for any sequence of independent and identically dis- 
tributed random variables {Zi,l < i < n} with ]E[Zj^] < oo, 

max |Zj|/-v/n — > 0, almost surely. 

l<i<n 

This implies that maxi<j<„ \ {Yi — f{Xi, (3)) ^'^^gg'^'^ \ = op{n^l'^), because ^[(1^ ■ 



a/3 

f{Xi,(3)) ^-^^g^'^^ ]'^ < oo. Using Taylor expansion to the quantity f{Xi,f3) 
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f{Xi,f3), with respect to X and by condition (Al), maxi<j<„ |£'ji| = op(n^/^) 
fohows from 

l-c/iil s — — — [^li — 



1=1 



1=1 

Q 



dxi 

" ^gi{Ui)p{Ui) Xi 



gl{U^)p{U^)nXl] 



dp 



l]X, 



dfiX^P) 



dp 



+ 



< 



gi{Ui)pm 
m{Ui)p{Ui) 

gi{u)p{u) 



Xi-nxi] ^Xi-nxi] 



sup 



gi{u)p{u) 
+ Op{n-^/^) sup 



E[Xi_ 



gi{u)p{u) 



gi{u)p{u) 

Q 

Y,{Op{h^ + logn) + Op{n-^/^)} 



nxi] 



+ Op(n-V2; 



Xt 



df{X,,l3) 



dp 



X, 



dfiX^,f3) 



dp 



1=1 



Xii 



df{X„P) 



dp 



Applying similar techniques to those used in the analysis of En, we have 
maxi<i<„|£'j2| =op(n^/2) and maxi<j<„ l^jsl =op(nV2). 

Further, applying the results of Proposition 1 and following the same 
arguments as were used in the proof of expression (2.14) in Owen (1990), 
we have 



(39) 



A = 0p(n-i/2). 



Applying Taylor expansion to (16), and invoking (38), (39) and (10) of 
Proposition 1, we can verify that 

(40) f(/3) = 2X:(a^G'„,,(/3) - liX^GnAP)}') +op{l). 

By (17), it follows that 

)_ 



^ 1 " GnAP) 



1=1 1=1 1=1 "■ 
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The application of (38), (39) and (10) of Proposition 1 yields 

/ n \ -1 n 

(41) A = GnAf3)Gl,{f3) GnAP) + op{n-^'' 

\i=l / i=l 

Equation (40), together with (41), obtains the decomposition 



(42) 

X 



n ' / V n . 

^ 1=1 / \ 1 = 1 



lY.GnA(3)j +0P{1). 



From Proposition 1 and (42), we have 

(43) = RAP){Rn{f3)Km-'Rn{(3) + op{l). 

Since Rnif3) is a sum of independent and identically distributed random 
variables, the result of Theorem 3 is immediately achieved by central limit 
theorems. □ 
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